Classification of Tumor Samples from Expression Data Using Decision Trunks

نویسندگان

  • Benjamin Ulfenborg
  • Karin Klinga-Levan
  • Björn Olsson
چکیده

We present a novel machine learning approach for the classification of cancer samples using expression data. We refer to the method as "decision trunks," since it is loosely based on decision trees, but contains several modifications designed to achieve an algorithm that: (1) produces smaller and more easily interpretable classifiers than decision trees; (2) is more robust in varying application scenarios; and (3) achieves higher classification accuracy. The decision trunk algorithm has been implemented and tested on 26 classification tasks, covering a wide range of cancer forms, experimental methods, and classification scenarios. This comprehensive evaluation indicates that the proposed algorithm performs at least as well as the current state of the art algorithms in terms of accuracy, while producing classifiers that include on average only 2-3 markers. We suggest that the resulting decision trunks have clear advantages over other classifiers due to their transparency, interpretability, and their correspondence with human decision-making and clinical testing practices.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Feature Selection and Classification of Microarray Gene Expression Data of Ovarian Carcinoma Patients using Weighted Voting Support Vector Machine

We can reach by DNA microarray gene expression to such wealth of information with thousands of variables (genes). Analysis of this information can show genetic reasons of disease and tumor differences. In this study we try to reduce high-dimensional data by statistical method to select valuable genes with high impact as biomarkers and then classify ovarian tumor based on gene expression data of...

متن کامل

Evaluation of PRR11 gene expression changes and its relationship with tumor size in patients with gastric adenocarcinoma

Introduction: Gastric cancer is one of the most common gastrointestinal tract neoplasms. Because of its invasion, and nonspecific symptoms and signs, the disease is often diagnosed at an advanced stage with short survival. PRR11 participates in the initiation and progression of lung cancer and breast cancer by regulating important genes involved in cell cycles and tumorigenesis. In this researc...

متن کامل

بررسی بیان پروتئین های p53 و HER2 با عوامل مرتبط با پیش آگهی کارسینوم معده به روش ایمونوهیستوشیمی

Background and purpose: Gastric carcinoma is one of the most common malignancies and its early diagnosis can be effective in the treatment method and the rate of the patients’ survival. Ïn the past recent years, many genes have been identified which may have relationship with the prognosis of this disease and the patient’s survival. This study is going to investigate the rate and pattern of p53...

متن کامل

Diagnosis of brain tumor using image processing and determination of its type with RVM neural networks

Typically, the diagnosis of a tumor is done through surgical sampling, which is more precise with existing methods. The difference is that this is an aggressive, time consuming and expensive way. In the statistical method, due to the complexity of the brain tissues and the similarity between the cancerous cells and the natural tissues, even a radiologist or an expert physician may also be in er...

متن کامل

Using DEA for Classification in Credit Scoring

Credit scoring is a kind of binary classification problem that contains important information for manager to make a decision in particularly in banking authorities. Obtained scores provide a practical credit decision for a loan officer to classify clients to reject or accept for payment loan. For this sake, in this paper a data envelopment analysis- discriminant analysis (DEA-DA) approach is us...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 12  شماره 

صفحات  -

تاریخ انتشار 2013